IN THE CLAIMS: 



Kindly replace the claims with the following: 

1 . (Currently Amended) A dialog system (1) comprising processing units for 

- automatic speech recognition (3), 

- natural language understanding (4), 

defining system outputs in dependence on information (7) derived from user inputs, 
generating acoustic and/or visual system outputs (9, 10, 11, 12), 
deriving user models (22, 25), whil e the us e r mod e ls (22, 25) contain from 
determined details about the style of speech of user inputs and/or details about 
interactions in dialogs between users and the dialog system (1) and 
adaptation of contents and/or form of system outputs is provided in dependence on 
the derived user models (22, 25). 

2. (Original) A dialog system as claimed in claim 1, 
characterized 

in that in addition to the input modality to use user inputs by means of speech, at least a 
further input modality is provided and 

in that the user models (22, 25) contain details about the respective use of the various 
input modalities by the user. 

3. (Previously Presented) A dialog system as claimed in claim 1, 
characterized 

in that the user models (22, 25) contain estimates for the reliability of recognition results 
derived from user inputs. 

4. (Original) A dialog system as claimed in claim 3, 
characterized 

in that in dependence on the estimates, system responses are generated which prompt the 
respective user to use such input modalities for which high estimate values were 
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determined and/or which prevent the respective user from using input modalities for 
which low reliability values were determined. 

5. (Previously Presented) A dialog system as claimed in one of the claims 1, 
characterized 

in that fixed models of user stereotypes (22) are used for forming the user models. 

6. (Previously Presented) A dialog system as claimed in one of the claims 1, 
characterized 

in that user models (25) are used which are continuously updated based on inputs of the 
respective user. 

7. (Currently amended) A method of operating a dialog system, in which processing units 
are used for 

- automatic speech recognition (3), 

- natural language understanding (4), 

- defining system outputs in dependence on information (7) derived from user inputs, 

- generating acoustic and/or visual system outputs (9, 10, 11, 12), and 

- deriving user models (13), from 

whil e th e us e r mod e ls contain details about the style of speech of user inputs and/or 
indications about interactions in dialogs between users and the dialog system (1) and 
an adaptation of adapting contents and/or form of system outputs is provid e d in 
dependence on the user models (22, 25). 

8. (Currently amended) A process for television-user dialog, comprising the steps of: 
receiving user speech input; 

processing the speech input using automatic speech recognition and natural language 
understanding; and 

defining at least one system output based on the speech input and a user model derived 
from details of the user inputs . 



3 



9. (Previously Presented) The process of claim 8, wherein the step of defining comprises 
the step of: 

defining at least one system output based on the speech input and a user 
model which includes an experience level, wherein the system output is based on the 
experience level of the user model in that if the experience level is low, the system output 
is a first length, while if the experience level is high, the system output is a second length 
lesser than the first length. 

10 (Previously Presented) The process of claim 8, wherein the step of defining comprises 
the step of: 

defining at least one system output based on the speech input and a user 
model which includes a likely input modality for a current prompt, wherein the system 
output is based on the likely input modality. 

11. (Previously Presented) The process of claim 8, wherein the step of defining comprises 
the step of: 

defining at least one system output based on the speech input and a user 
model which includes a familiarity level, wherein the system output is based on the 
familiarity level. 

12. (Previously presented) The process of claim 8, further comprising the steps of: 

receiving a user face image; and 

determining a degree of despair based on the user face image; wherein the 
step of defining comprises the step of: 

defining at least one system output based on the degree of despair. 
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